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THE MEASUREMENT OF PERSONALITY* 


By Forrest H. Kirkpatrick 
Bethany College 


A discussion with special reference to the Bernreuter Personality 
Inventory and its use at Bethany College. 


INTRODUCTION 


Probably since the earliest days men have attempted to classify 
and describe their fellows and to explain those characteristics that 
mark one man as different from another. Various devices and 
techniques have been used from time to time by honest students in 
the field and by charlatans who have preyed upon the interest of 
men in this subject. 

Some psychologists, who have studied in this field, have usually 
put an emphasis upon the description and classification of the total 
personality. This concept seems to have come from the field of 
philosophy rather than the field of science which has long been 
approaching its problems in an atomistic fashion. Some have al- 
most assumed that personality is something simple and static. We 
have been charged to remember “the whole man.” 

With the growing realization that personality is complex and 
dynamic, the attempts to describe it and classify it have gone along 
on more modest lines. Most psychologists today are content to 
study and describe single aspects of personality. Only in the case 
study method do we think of studying “the whole man,” and the 
concept of personality as a “unitary something” no longer survives. 
Psychologists now think about the measurement and study of 
“traits” and “manifestations” of personality. There is, of course, 
a constant danger in this procedure for too much importance may 
be attached to a single aspect, and this may obscure the greater 
significance of the total integrated personality. It has been fruit- 
ful, however, in that it has made possible an empirical attack on 
these important problems. 

Within the last decade some attention has been given to the 
matter of devising tests for the measurement of personality traits. 


* Recommended for publication by Dr. J. R. Kantor, September 5, 1939. 
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The early work of Woodworth and the development of a psycho- 
neurotic inventory (“Personal Data Sheet”) ‘”’ paved the way in 
terms of technique and encouragement for many investigations. 
Some of the others who followed in this field were House’, 
Laird”, Freyd“’, Pressey’, Allport”, Thurstone®’, Bernreu- 
ter’, Symonds’, Cowan”, Pintner’”’, and Guilford”. Thurs- 
tone’s “Personality Schedule” was largely adapted from the earlier 
work in this field. It has certain weaknesses that have been pointed 
out rather clearly.“” Symonds has undertaken to measure “ad- 
justment” and his tests are more definite departures from what 
has been done by some of the others. An eight page inventory set 
up to be administered as a group test and designed to reveal mal- 
adjusted pupils in Grades 4 to 9 was published by Pintner and 
associates in 1938. Cowan’s “Adolescent Personality Schedule” 
is modeled after the Thurstone schedule. “The Nebraska Person- 
ality Inventory” by the Guilfords aims to measure three of the 
five factors obtained from a factorial analysis of the responses to 
thirty-six questions of the type which have been traditionally used 
to diagnose tendencies toward introversion or extroversion. The 
growing interest in personality measurement is revealed by the in- 
creasing number of publications in this field and the place it has 
assumed in educational and psychological study and discussion. 
It is not certain, however, that the actual measurement of per- 
sonality has gotten under way even now. Terman’ maintains 
that at present we are not measuring personality but responses 
which reveal certain aspects of personality. Further analysis should 
be made of the significance of the variations in responses to specific 
items as well as total test. He criticizes measurement enthusiasts 
who lose sight of the dangers inherent in the reduction of psy- 
chological data by mathematical tools. He asks the question, “Is 
it really possible to measure personality in the strict sense of the 
word ‘measure’ as the physicist, for example, is accustomed to use 
the term?” Terman feels that the clinical approach is absolutely 
necessary for the interpretation of the personality as a whole. 
Hendrickson“ in a discussion of the assumptions underlying per- 
sonality measurement considers the nature and stability of personal- 
ity as well as the social factors involved. Vernon” in writing on 


“The Measurement of Personality and Temperament” says that a 
temperament or personality trait can not be defined solely in terms 
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of objective behavior, but is dependent upon observation and inter- 
pretation of such behavior by human minds. He feels that a single 
test, or set of ratings, can not give an adequate measure of a trait; 
instead a variety of diverse tests should be combined into a positive 
score. 

In this discussion no mention has been made of the development 
and use of instruments to measure psychogalvanic reflex and other 
physiological indications of attitude, the studies in constitution and 
its significance for typology, or the special clinical methods. These 
are certainly “measurements of personality” and: are omitted be- 

_cause of the severe restrictions of this paper. It is interesting to 
note that in a review of the literature from January 1932 to Janu- 
ary 1935 Olson“*’ found 161 titles covering reports of studies treat- 
ing: (a) contributions to technique, including systematic observa- 
tions, ratings, tests, expressive movements, psychogalvanic studies, 
physiological studies, inventories, factor analyses; (b) characteris- 
tics, traits and constellations of behavior, including delinquency, 
character and personality scales, developmental ages, emotions, eat- 
ing behavior, friendship and quarreling, honesty, humor, leadership 
and popularity, negativism, “only” children, play, recklessness, so- 
cial and ethical information, socio-economic factors, studiousness, 
and vocalization; (c) the patterning of character and personality 
measures; (d) the modification of informatién and conduct includ- 
ing direct and indirect instruction, motivation, and the effect of or- 
ganization membership. 


BERNREUTER INVENTORY 


While working at Leland Stanford University, Robert G. Bern- 
reuter constructed a test which he entitled “Personality Inventory.” 
It was first described in the psychological literature in 1933“ al- 
though Symonds “’ mentions it in his book “Diagnosing Personal- 
ity and Conduct” which was published in 1931. 

The Personality Inventory was designed at first to estimate those 
traits which have been termed neurotic tendency, introversion-ex- 
troversion, ascendance-submission, and self-sufficiency. It was con- 
structed “to determine the feasibility of estimating more than a 
single personality trait at one time,” and according to Bernreuter’” 
it “assumes that the integrated behavior of an individual in any 
situation may be interpreted from various points of view.” It might 


« 
ee, 
2 


214 FORREST H. KIRKPATRICK 


be said that this is an integration of the various questionnaires and 
schedules which preceded Bernreuter’s work. His first inventory 
contains 125 questions which are to be answered by encircling one 
of three responses, i.e. yes, no, and ?, and it is to be scored in four 
ways to yield (a) a measure of neurotic tendency, (b) a measure 
of self-sufficiency, (c) a measure of introversion-extroversion, (d) 
a measure of dominance-submission. It is largely because it was 
advertised as a “four-in-one” measure that this Inventory has been 
used so widely. Reliability coefficients were reported for college 
psychology classes shortly after the test was issued, all of which 
were over .85. Reliability was established by the split-half method. 

The four items on the Bernreuter were validated by correlating 
them against (a) Thurstone’s Personality Schedule, (b) the Bern- 
reuter Self-Sufficiency Test, (c) the Laird C-2 Introversion Test, 
(d) the Allport Ascendance-Submission Reaction Study. The 
reported coefficients of correlation, when correlated for attenua- 
tion, are all over .90 (with one exception), and most of them are 
close to 1.00. The coefficients of validity as reported are: 


Group A Group B 
Uncorr. Corr. Uncorr. Corr. 
.94 1.00 91 99 
76 99 .69 .92 
B4D and AS (men) .............. 81 1.00 67 84 
B4D and AS (women) .......... -... 82 . 99 


In concluding a statement on “Validity of the Personality Inven- 
tory” Bernreuter said: “It now seems apparent that scores ob- 
tained under standard conditions are not due merely to chance; 
neither do they represent attempts to gain social approval; nor do 
they indicate what subjects would like to be like.” 

In checking on the items of the Bernreuter Inventory we found 
that a great many of the question items were evidently taken from 
tests used in validating the Inventory. There are 50 Bernreuter 
items similar to 50 items in the Thurstone schedule, and around 
30 items in common with items from the Allport A.S. test. These 
findings would suggest that the high validity coefficients reported 
by Bernreuter are to a large degree spurious. 
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The exact questions in the Inventory follow the Thurstone pat- 
tern very closely. Such questions as the following are used: 


1. Does it make you uncomfortable to be “different” or uncon- 
ventional? 

2. Are you troubled with shyness? 

3. Do you very much mind taking back articles you have pur- 
chased at stores? 

4. Do you often experience periods of loneliness? 

5. If you are dining out do you prefer to have someone else 
order for you? 

6. Do you usually ignore the feelings of others when accom- 
plishing some end which is important to you? 

7. Do you often find that you cannot make up your mind until 
the time for action is passed? 

8. Do you especially like to have attention from acquaintances 
when you are ill? 

9. Do you experience many pleasant or unpleasant moods? 

10. Are you troubled with feelings of inferiority? 

11. Does some particularly useless thought keep coming into your 
mind to bother you? 

12. Do you ever upbraid a workman who fails to have your work 
done on time? 

13. Are you able to play your best in a game or contest against 
an opponent who is greatly superior to you? 

14. Do you usually avoid asking advice? 

15. If you are spending an evening in the company of other peo- 
ple do you usually let someone else decide upon the entertain- 
ment? 

16. Do you take the responsibility for introducing people at a 
party? 

17. If you come late to a meeting would you rather stand than 
take a front seat? 

18. Does your mind often wander so badly that you lose track of 
what you are doing? 

19. Do you find that people are more stimulating to you than 
anything else? 

20. Do your feelings alternate between happiness and sadness 


without apparent reason? 
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The Bernreuter Inventory has enjoyed great popularity and many 
persons have experimented with it. The findings from all of the 
various experiments could not be summarized here. It is possible 
to mention only the a which represent types of studies 

undertaken. 


(a) Johnson” studied the effect of mood on personality traits 
as measured by Bernreuter using 15 college women. He 
determined mood by a word list to which they responded. 
Small differences in Bernreuter were found for the two 
moods. 

(b) Darley & Ingle“ administered the Inventory to employed 
and unemployed workers. They found it effective in iden- 
tifying those individuals who had been previously diagnosed 
as psychotic cases. 

(c) Landis & Katz’, who gave Bernreuter’s and Thurstone’s 
inventories to psychiatric patients, concluded that “the scores 
on the tests are not adequate measures of neurotic tendency, 
so far as psychiatric patients are concerned.” 

(d) Laslett & Bennett” correlated the scores on Bernreuter’s 
Inventory with scores on the Kent-Rosanoff clinical test of 
free association, and found no relationship between the two. 
They concluded that “the two tests do not measure the same 
thing or they do not measure it in the same way.” 

(e) Yu” reports that the difference between manic-depressive 
and dementia praecox cases on the Inventory was too slight 
“to warrant its use as a diagnostic tool.” 

(f) Omwake et al” found no relationship between scores on 
the Inventory and physiological tests of metabolism, blood 


pressure, etc. 


Bernreuter’s basic contribution to the measurement of personal- 
ity is that of regarding the subject’s response to each item of a test 
as an indication of several types of adjustment, of several modes of 
reaction, or of several traits. The Inventory was first intended to 
be what was referred to above as a “four-in-one” measure but he 
has revised it until he now claims that it can be used to appraise 
six different personality traits. Bernreuter was the first person to 
apply this principle of multiple scoring to personality tests, although 
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both Cowdery and Strong have made use of the principle in the . 
measurement of “interests.” 


BETHANY REsuULTs 


For a few years we used the Bernreuter Inventory at Bethany 
College to study the persistence of the personality traits through 
the first two years of the college experience but we have not been 
able to continue the study during the last four years. It was as- 
sumed that the Inventory gives an evaluation of personality traits 
and that the traits which it purports to measure, i.e. (a) neurotic 
tendency, (b) self-sufficiency, (c) introversion, (d) dominance, 
are independent. 

Bethany freshmen were given the Bernreuter in the early fall 
of 1931, 1932, and 1933. These same students were again tested 
by the Bernreuter near the end of their sophomore year, i.e., May 
1933, May 1934, and May 1935. The number of students for 
which we have full reports are as follows: 


Class entering 1931: 48 
Class entering 1932: 46 
Class entering 1933: 46 


Our study is based on these three groups. 
For the first class the means for each trait, as computed from 
Bernreuter’s established percentiles at that time, are as follows: 


Trait Average Score 
51.7083 
49.0417 
46.9167 

For this same class, as of May 1933, the means are as follows: 

Trait Average Score 
49.3542 
48.1458 


There is not much to learn from a study of the averages of the 
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_ testings, but the correlation of the individual scores on these traits 
gives some indication as to the persistence of these traits through 
the freshman and sophomore year. The correlation ratio was ob- 
tained by the Pearson product-moment method. The results for the 
three classes are listed in the table below. 


CORRELATION BETWEEN SCORES OF COLLEGE STU- 
DENTS TESTED AT BEGINNING OF FRESHMAN YEAR 
AND AT END. OF SOPHOMORE YEAR 


Class entering Class entering Class entering 


Trait in 193] in 1932 in 1933 
Neurotic tendency .7515 .7567 .6108 
Self-sufficiency -......... .7822 
Introversion .............- 7444 .7253 .7042 
Dominance ................ .7668 .6573 .7054 


Except in the scores on “self-sufficiency” in the class which en- 
tered in 1932, the traits show a fairly high persistence. At least we 
know there is a fairly high relationship in the scores obtained from 
the two testings. The fact that the correlation is no higher would 
suggest that there are personality changes going on in the college 
experience. Minor alterations in personality within the same indi- 
vidual are common, and we would expect changes of some kind 
through those late adolescent years. It is likely that they would 
be perceptible changes—and with such an instrument as this, meas- 
urable changes. We think of personality as in something of a state 
of flux all of the time. 

The deeper questions as to whether these changes are toward a 
more wholesome personality, how much the college contributes to 
personality development, and the illumination and correction of 
problems that prevent good adjustment quickly suggest themselves, 
but not the answers. 


From the standpoint of the personnel officer and instructor, the 
correlation treatment yields only limited conclusions. The person 
who deals directly with the student is concerned with the way the 
scores shift and in what direction that shift is made. By tabulating 


MEASUREMENT OF PERSONALITY 219 


and studying the changes that occurred in individual scores, it is 
possible to make some interesting observations. 

A partial analysis of the scores from the class entering in 1931 
is included to show the number of students whose scores shifted 
one way or the other and the extent of the shift in the two college 
years. The data are as follows: 


Changed 1 to 10 Changed over 10 
points in percen- points in percen- 


Total tile rank tile rank 
NEUROTIC TENDENCY 
Increase .............. 18 11 7 
Decrease ............. 29 13 16 
Unvaried ............ l 
SELF-SUFFICIENCY 
.............- 28 18 10 
Decrease .............- 16 8 8 
Unvaried ............ 4 
INTROVERSION 
Increase .............. 18 7 11 
Decrease .............. 26 12 14 
Unvaried ............ 4 
DOMINANCE 

Increase .............- 36 18 18 
Decrease .............. 8 5 3 
Unvaried ............ 4 


An interesting observation brought out by this tabulation is best 
shown by presenting the data as follows: 


Direction of 


Trait Greatest Number of Changes 
Neurotic tendency ..............-.---2----+-- Decrease (9) 
Increase (12) 
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In each trait measured, the majority of individuals show a 
change in the “better” direction. In other words, the scores of nine 
more individuals decreased in neurotic tendencies than increased, 
twelve more increased in self-sufficiency than decreased, eight more 
decreased in tendency to introversion than increased, and twenty- 
eight more increased in tendency to dominate than decreased. It is 
not too much to assume that, in this group, the majority seem to 
have profited in terms of “wholesome personality” from their col- 
lege experiences. There is evidence, of course, that some students 
made poorer adjustment. It is especially interesting that in the 
dominance trait, the number with increased scores is larger than anv 
other. Here the number of individuals with increased scores ex- 
ceeds the number with decreased scores by an amount equal to one 
less than the sum of the differences in the other three combined. 
Perhaps the dominance trait has its “flowering” in that sophomore 


year. 


RELIABILITY 


The correlations reported at Bethany College between student 
scores on the Bernreuter in September and the scores on the same 
test several months later might be considered as “coefficients of 
reliability” as determined by the re-test method. In the light of 
what we know of the dynamics of personality, this would not be a 
wise suggestion. As mentioned above, Bernreuter determined relia- 
bility by the split-half method, correcting this coefficient by the 
Brown-Spearman formula. This is actually only a measure of in- 
ternal consistency, but that in itself is a most important considera- 
tion. He reported reliability coefficients as follows: 


Trait Group A Group B 
Self-suffitiency 92 85 


A few of the typical reliabilities of questionnaires of this kind, 
taken from the experimental literature, are as follows: 
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Method of 
determining Coefficient of 
Test Reported by reliability _ reliability 
Woodworth Psycho. Mathews Split-half 90 


Woodworth-Mathews § Mathews Split-half 667 


Woodworth-Mathews Mathews Retest 697 
Colgate, M.H. Hoitsma Retest 85 
Colgate, M.H. Hoitsma Split-half 79 
Heidbreder, I.E. Heidbreder f-associates 

ratings 
Marston, I.E. Marston Split-half 89 
Cowdery Interest Cowdery Split-half 827 
Mental Hyg. Inven. House Retest 85 
Thurstone Person. Thurstone Split-half .946 
Adjustment Question- | Symonds Split-half .90 

naire 
SUMMARY 


In the field of personality measurement, the time is not ripe for 
wide generalizations and conclusions. We are at the place where 
we need to face many critical questions. Terms need to be more 
closely defined. “Personality” itself needs to be defined in more 
specific language. Just what we are attempting to measure is not 
always clear. Are we measuring the exact status of an individual’s 
behavior or are we measuring the probability of action by that indi- 
vidual in one direction or another? Perhaps we measure both. 

That great progress has been made in this field is evident. The 
instruments now available are making valuable contributions in per- 
sonnel and psychological work. These instruments possess only lim- 
ited diagnostic value, and they can not be expected to take the place 
of the personal interview and case study. It is not unlikely that per- 
sonality tests will be used even more widely than intelligence tests 
in our social, penal, and educational agencies. The adjustment 
status of individuals is at the root of many problems and issues. 

The Bethany study with the Bernreuter gives the first re-test cor- 
relations so far reported at the college level. The ratios obtained 
indicate some persistence in the traits measured, but for a more 
complete understanding we must wait until we have techniques to 
equate conditions of environment, general stimulation, and many 
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other factors. One of the recurring questions with the Bernreuter 
is whether it actually measures the four traits independently. After 
making a statistical study of the scores made by a group of adults, 
Lorge"”’ suggests that probably not more than two independent 
traits can be used to account for the “traits” of the Inventory. 
Lorge writes: “Personality traits can not be created by the psy- 
chologist. If the concept of personality is to have meaning, it must 
be conceived as an aspect of the individual—an aspect susceptible 
to quantification. Naming a trait does not make it a trait. To be 
useful in psychology, a trait must be regarded as freed from the 
influences of other traits, and other aspects of the individual. The 
separation of neurotic tendency from introversion-extroversion is a 
desirable goal, but it is a goal not easily achieved by consideration 
of measurements that correlate too highly. Purity of trait from the 
point of view of consistency of measurement, from the point of 
view of reliability of measurement, and from the point of view of 
independence of measurement, is the only hope in the all too elusive 
field of personality diagnosis.” 


BIBLIOGRAPHY 


1. Allport, G. W. A test for ascendance-submission, J. Abn. 
& Soc. Psychol., 1928, 23, 118-136. 

2. Bernreuter, R. G. Validity of Personality Inventory, Per- 
sonnel J., 1933, 11, 383-386. 

3. Bernreuter,R.G. Theory and construction of the personal- 
ity inventory, J. Soc. Psychol., 1933, 4, 387-405. 

4. Cowan, E. A., and others. An adolescent personality sched- 
ule, Child Development, 1935, 6, 77-87. 

5. Darley, J. G., and Ingle, D. J. An Analysis of the Bern- 
reuter Personality Inventory in Occupational Guidance, 
Bull. Employment Stabilization Res. Inst., Univ. of Minn., 
1934, 3, 32-41. 

6. Freyd, M. Introverts and extroverts, Psychol. Rev., 1924, 
31, 74-87. 

7. Guilford, J. P., and R. B. Personality Factors S, E, and 
M, and Their Measurement, J. of Psychol., 1936, 2, 109- 
127. 


17. 


18. 


19. 


20. 


21. 


22. 


MEASUREMENT OF PERSONALITY 223 


Hendrickson, G. Some Assumptions Involved in Personal- 
ity Measurement, J. Exper. Educ., 1934, 2, 243-249. 

House, S. D. A mental hygiene inventory, Arch. Psychol., 
1927, 14 No. 88, p. 112. 

Johnson, W.B. The effect of mood on personality traits as 
measured by Bernreuter, J. Soc. Psychol., 1934, 5, 512- 
522. 

Kelley, T. L. Interpretation of educational measurements, 
Yonkers, N. Y., World Book Co., 1927, p. xiii-363. 

Kirkpatrick, F. H. The Validity of the Thurstone Person- 
ality Schedule, Proc. W. Va. Acad. Sci., May, 1936. 

Laird, D. A. Detecting abnormal behavior, J Abn. & Soc. 
Psychol, 1925, 20, 128-141. 

Landis, C., and Katz, S. E. The Validity of Certain Ques- 
tions Which Purport to Measure Neurotic Tendencies, J. 
Appl. Psychol., 1934, 18, 343-356. 

Laslett, H. R., and Bennett, E. A Comparison of Scores 
on Two Measures of Personality, J. Abn. & Soc. Psychol., 
1934, 28, 459-461. 

Lorge, I. Personality Traits by Fiat, J. of Educ. Psychol., 
1935, 26, 273-278. 

Olson, W. C. Measures of character and personality through 
conduct and information, Rev. Educ. Res., 1935, 5, 273- 
290. 

Omwake, K. T., Dexter, E. S., and Lewis, L. W. The Inter- 
relations of Certain Physiological Measurements and As- 
pects of Personality, Character and Personality, 1934, 3, 
64-71. 

Pintner, Rudolf, Leftus, J. J., Forlarno, and Alster, B.— 
Aspects of Personality, World Book Co., 1938. 

Pressey, S. L. A group scale for investigating the emotions, 
J. of Abn. & Soc. Psychol., 1921, 16, 55-64. 

Symonds, P. M. Diagnosing personality and conduct, New 
York, D. Appleton & Co., 1931. 

Symonds, P. M., and Jackson, C. E. Measurement of Per- 
sonality Adjustments of High School Pupils, New York, 
Bureau of Publications, Teachers College, Columbia Uni- 
versity, 1935. 


9. 
10. 
12. 
13. 
15. 
16. 
|_| 
= 
|| 


FORREST H. KIRKPATRICK 


Terman, L. M. The measurement of personality, Science, 
1934, 80, 605-608. 

Thurstone, L. L., and T. G. A neurotic inventory, J. Soc. 
Psychol., 1930, 1, 3-30. 

Vernon, P. E. The measurement of personality, Human 
Factor, London, 1934, 8, 87-95. 

Woodworth, Robert S. (Revised by Vernon M. Cady) 
Personal Data Sheet, published by C. H. Stoelting & Co. 

Yu, P. E. A Personality (Bernreuter) Study of Two 
Groups of Psychotics, Psychol. Bull. 1934, 31, 586-587 
(Abstract) . 


224 
23. 
24. 
26. 
27. 
] 


4 
| 
| 
i 
= 
4 
| 


